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TITLE: METHODS OF DETECTING INTERACTIONS BETWEEN 
PROTEINS, PEPTIDES OR LIBRARIES THEREOF USING 
FUSION PROTEINS 

5 INVENTORS: Andrew D. Hamilton, Indraneel Ghosh, and Lynne Regan 

RELATED APPLICATIONS 

[0001] This application claims the benefit of priority of U.S. Provisional Application 
60/203,712, filed on May 12, 2000. 

10 

FIELD OF THE INVENTION 

[0002] The present invention is related to the reassembly of fusion peptides into a 
functionally active protein complex. Specifically, the present invention provides a 
method of forming peptide complexes that associate through the combination of helical 
15 domains to form an antiparallel leucine zipper. The present invention is also related to 
the use of assays to investigate protein-protein interactions. The assays of the present 
invention involve the association of fusion proteins comprising GFP fragments and 
heterologous polypeptides into functionally active GFP that exhibits fluorescence. 

20 BACKGROUND OF THE INVENTION 

[0003] All publications and patent applications herein are incorporated by reference to 
the same extent as if each individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 

25 Green Fluorescent Protein 

[0004] Green fluorescent protein (GFP), a relatively small protein comprising 238 
amino acids, is the ultimate source of fluorescent light emission in the jellyfish Aequorea 
victoria. The gene for GFP was first cloned by Prasher et al (1992, Gene, 1 1 1:229-233), 
and cDNA for the protein produces a fluorescent product identical to that of native 

l-WA/1 61 1537.1 



44574-5100-US 



2 

protein when expressed in prokaryotic (E. coli) and eucaryotic (C. elegans) cells (Chalfie 
etal, 1994, Science, 263, 802-805). 

[0005] The GFP excitation spectrum shows an absorption band (blue light) maximally 
at 395 nm with a minor peak at 470 nm, and an emission peak (green light) at 509 nm. 
5 The longer-wavelength excitation peak has greater photostability then the shorter peak, 
but is relatively low in amplitude (Chalfie et al, 1994, Science, 263:802-805). The 
crystal structure of the protein and of several point mutants has been solved (Ormo et al, 
1996, Science 273, 1392; Yang et al., Nature Biotechnol 14, 1246). The fluorophore, 
consisting of a tripeptide at residues 65-67, is buried inside a relatively rigid beta-can 

10 structure, where it is almost completely protected from solvent access. The GFP 
absorption bands and emission peak arise from an internal p-hydroxybenzylidene- 
imidazolidinone chromophore, which is generated by cyclization and oxidation of the 
tripeptide sequence Ser-Tyr-Gly sequence at residues 65-67 (Cody et al, 1993, 
Biochemistry 32:1212-1218). 

15 [0006] GFP fluorescence in procaryotic and eucaryotic cells does not require exogenous 
substrates and cofactors. Accordingly, GFP is considered to have tremendous potential in 
methods to monitor gene expression, cell development, or as an in situ tag for fusion 
proteins (Heim et al, 1994, P.N.A.S. USA, 91,12501-12504). Chalfie and Prasher, WO 
95/07463 (March 16, 1995), describe various uses of GFP, including a method of 

20 examining gene expression and protein localization in living cells. Methods are described 
wherein: 1) a DNA molecule is introduced into a cell, said DNA molecule having DNA 
sequence of a particular gene linked to DNA sequence encoding GFP such that the 
regulatory element of the gene will control expression of GFP; 2) the cell is cultured in 
conditions permitting the expression of the fused protein; and 3) detection of expression 

25 of GFP in the cell, thereby indicating the expression of the gene in the cell. Methods such 
as those described by Chalfie and Prasher are advantageous compared to previously 
reported methods which utilized 3-galactosidase fusion proteins (Silhavy and Beckwith, 
1985, Microbiol. Rev., 49, 398; Gould and Subramani, 1988, Anal Biochem., 175, 5; 
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Stewart and Williams, 1992, J. Gen. Microbiol, 138,1289) or luciferases, in that the need 
to fix cell preparations and/or add exogenous substrates and cofactors is eliminated. 
[0007] GFP is a valuable marker for intracellular protein localization. However, the 
fusion of GFP with structural proteins can alter their properties, resulting in loss of fusion 
5 protein localization, decreased GFP fluorescence or both. The fluorescence of this 
protein is sensitive to a number of point mutations (Phillips, G. N., 1997, Curr. Opin. 
Struct. Biol. 7, 821-27). The fluorescence appears to be a sensitive indication of the 
preservation of the native structure of the protein, since any disruption of the structure 
allowing solvent access to the fluorophoric tripeptide will quench the fluorescence. 

10 Abedi et al. (1998, Nucleic Acids Res., 26, 623-30) have inserted peptides between 

residues contained in several GFP loops. Inserts of the short sequence LEEFGS (SEQ ID 
NO: 9) between adjacent residues at 10 internal insertion sites were tried. Of these, 
inserts at three sites, between residues 157-158, 172-173 and 194-195 gave fluorescence 
of at least 1% of that of wild type GFP. Only inserts between residues 157-158 and 172- 

15 1 73 had fluorescence of at least 10% of wild type GFP. 

Protein Reassembly using Leucine Zipper 
[0008] The unassisted reconstitution of proteins from peptide fragments has been 
demonstrated for several proteins; including ribonuclease (Richards et al, 1959, J. Biol. 

20 Chem. 234, 1459-1465), chymotrypsin inhibitor-2 (Gay et al, 1994, Biochemistry, 33, 
7957 -7963), tRNA synthetases (Shiba et al, 1992, Proc. Natl Acad. Sci. U.S.A., 89, 
1880-1884), and inteins (Southworth, et al, 1998, EMBOJ., 17, 918-926). Protein 
reassembly has thus become an important avenue for understanding enzyme catalysis 
(Richards et al, 1959, J. Biol Chem. 234, 1459-1465), protein folding (Gay et al, 1994, 

25 Biochemistry, 33, 7957 -7963), and protein evolution (Shiba et al, 1992, Proc. Natl 
Acad. Sci. U.S.A., 89, 1880-1884). Recently, assisted protein reassembly or "fragment 
complementation" has been applied to the in vivo detection of protein-protein interactions 
in such systems as dihydro folate reductase (DHFR) (Pelletier et al, 1998, Proc. Natl 
Acad. Sci. U.S.A., 95, 12141-12146; Remy etal, 1999, Proc. Natl Acad. Sci. U.S.A., 96, 
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5394-5399; Pelletier etal, 1999, Nat. Biotechnol, 17, 683-690), ubiquitin (Karimova et 
al, 1998, Proc. Natl. Acad. Sci. U.S.A., 95, 5752-5756; Johnsson et al, 1994, Proc. Natl. 
Acad. Sci. U.S.A., 91, 10340-10344), and 3-galactosidase (Rossi et al, 1997, Proc. Natl. 
Acad. Sci. U.S.A., 94, 8405-8410). These reassembly processes are contingent upon the 
5 proper choice of a dissection site within a protein and can be aided by techniques such as 
limited proteolysis, circular permutation (Baird et al, 1999, Proc. Natl. Acad. Sci. U.S.A., 
96, 11241-11246; Topell etal, 1999, FEBS Lett., 457, 283-289; Zhang etaL, 1993, 
Biochemistry, 32, 12311-12318; Regan, L., 1999, Curr. Opin. Struc. Biol, 9, 494- 
499)and loop insertions (Abedi et al, 1998, Nucleic Acid Res., 26, 623-630; Nobuhide et 

10 al., 1999, FEBS Lett., 453, 305-307). 

[0009] The dissection and subsequent reassembly of a protein from peptidic fragments 
provides an avenue for controlling its tertiary structure and hence its function. Although 
a majority of leucine zippers associate in a parallel fashion, recent examples of both 
naturally occurring and designed antiparallel leucine zippers have appeared in the 

15 literature (Lupas, A., 1996, Trends Biochem. Sc. 21, 375-382; Kohn, W. D. et al, 1997, 
S. J. Biol. Chem. 272, 2583-2586; Bryson, J. W. etal, 1995, Science, 270, 935-941; 
Oakley M. G. et al, 1998, Biochemistry, 37, 12603-12610; Oakley, M. G. et al, 1997, 
Biochemistry, 36, 2544-2548). However, the prior art does not disclose the attachment of 
antiparallel leucine zippers to polypeptide fragments to form fusion proteins for 

20 reassembling the polypeptide fragments into functional proteins. 

[0010] In contrast to parallel zippers, the antiparallel zippers are oriented in an opposite 
direction. Antiparallel Zippers have the advantage of occurring less frequently in natural 
proteins. Thus, antiparallel leucine zippers will interfere to a lesser extent with natural 
cellular proteins than parallel leucine zippers. Antiparallel attachment of leucine zippers 

25 to protein fragments (between a dissected peptide bond of the parent protein) requires a 
shorter amino acid linker region. As shown by the inventors of the present invention, as a 
preferred embodiment, a linker having 4-6 amino acids is sufficient (see Examples). 
Similar attachment of parallel leucine zippers would require > 10 amino acids to span the 
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necessary distance. The long unstructured linkers would be prone to proteolytic cleavage 
and be less stable in in vivo assays. 

[0011] Katz et al (1998, Biotechniques, 25, 298) describe a targeting approach based on 
noncovalent heterodimerization of GFP and cytoplasmic structural proteins using a 
5 leucine zipper designed to form high-affinity heterodimers. The complexes localized 
accurately to specific sites within cells, providing selective fluorescence labeling of 
subcellular structures such as microfilaments or focal contacts. 

Protein-Protein Interaction Assays 

10 [0012] The association and dissociation of proteins is crucial to all aspects of cell 
function. Examples of protein-protein interactions are evident in hormones and their 
respective receptors, in intracellular and extracellular signalling events mediated by 
proteins, in enzyme substrate interactions, in intracellular protein trafficking, in the 
formation of complex structures like ribosomes, viral coat proteins, and filaments, and in 

15 antigen-antibody interactions. Intracellular assays for detection of protein interactions 
and identification of their inhibitors have received wide attention with the completion of 
the human genome sequence. 

[0013] US Patent 5,585,245 discloses a first fusion protein comprising an N-terminal 
subdomain of ubiquitin, fused to a non-ubiquitin protein or peptide and a second fusion 

20 protein comprising a C-terminal subdomain of ubiquitin, fused to the N-terminus of a 

non-ubiquitin protein or peptide. The patent discloses the use of these fusion proteins for 
studying protein-protein interactions. When contacted with one another, provided that the 
non-ubiquitin proteins or peptides interact (bind) with one another, the N- and C-terminal 
ubiquitin subdomains associate to reconstitute a quasi-native ubiquitin moiety which is 

25 recognized and cleaved by ubiquitin-specific proteases. However, this assay requires the 
use of additional cellular factors, such as the ubiquitin-specific proteases, for detection of 
protein-protein interaction. Thus, this assay is not feasible for high throughput screening 
of cDNA libraries. 
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[0014] US Patent 5,362,625 discloses omega-acceptor and omega-donor polypeptides 
(comprising about two-thirds and one-third of the 3-galactosidase molecule amino and 
carboxyl termini, respectively), prepared by recombinant DNA techniques, DNA 
synthesis, or chemical polypeptide synthesis techniques, which are capable of interacting 
5 to form an active enzyme complex having catalytic activity characteristic of 3- 
galactosidase. The patent also describes the use of these polypeptides in enzyme 
complementation assays for qualitative and quantitative determination of a suspected 
analyte in a sample. 

[0015] The yeast two-hybrid system for detecting protein-protein interactions in 

10 Saccharomyces cerevisiae (Fields and Song, 1989, Nature, 340:245-246; U.S. Pat. No. 
5,283,173 by Fields and Song) is well known in the art. This assay utilizes the 
reconstitution of a transcriptional activator like GAL4 (Johnston, 1987, Microbiol. Rev., 
51:458-476) through the interaction of two protein domains that have been fused to the 
two functional units of the transcriptional activator: the DNA-binding domain and the 

15 activation domain. This is possible due to the bipartite nature of certain transcription 
factors like GAL4. Being characterized as bipartite signifies that the DNA-binding and 
activation functions reside in separate domains and can function in trans (Keegan et al, 
1986, Science 23 1 :699-704). The reconstitution of the transcriptional activator is 
monitored by the activation of a reporter gene like the lacZ gene that is under the 

20 influence of a promoter that contains a binding site (Upstream Activating Sequence or 
UAS) for the DNA-binding domain of the transcriptional activator. This method is most 
commonly used either to detect an interaction between two known proteins (Fields and 
Song, 1989, Nature, 340:245-246) or to identify interacting proteins from a population 
that would bind to a known protein (Durfee et al., 1993, Genes Dev., 7:555-569; Gyuris et 

25 al, 1993, Cell, 75:791-803; Harper et al, 1993, Cell, 75:805-816; Vojtek et al, 1993, 

Cell, 74:205-214). Like the ubiquitin system, additional factors are required for detection 
of the protein-protein interaction. Additionally, in the yeast two-hybrid system, the 
protein interaction must occur in the nucleus of the yeast. 
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[0016] WO 98/34120 describes protein fragment complementation assays for detecting 
bimolecular interactions. The assays comprise coexpression of fusion peptides consisting 
of N- and C-terminal fragments of murine DHFR fused to GCN4 leucine zipper 
sequences in E. coli to form colony. Colony formation only occurs when both DHFR 
5 fragments are present and contain leucine-zipper forming sequences. The published 
patent application contemplates the use of the assay to study molecular interactions 
including protein-protein, protein-DNA, protein-RNA, protein-carbohydrate, and protein- 
small molecule interactions, and for screening cDNA libraries for binding of a target 
protein with unknown proteins or libraries of small organic molecules for biological 

10 activity. WO 98/34120 also contemplates the use of GFP in the protein fragment 
complementation assay. However, the published patent application does not suggest 
fusing antiparallel leucine zipper to DHFR or GFP for reconstitution. GCN4 disclosed in 
the published application and routinely used by skilled artisan to reassemble proteins 
especially in the yeast two hybrid system, is a parallel zipper. Antiparallel and parallel 

15 zippers orient proteins in opposite direction; thus, it is not predictable that an antiparallel 
zipper can be substituted for a parallel zipper. 

[0017] Additionally, all protein reassembly strategies disclosed in WO 98/34120 are for 
reassembly of multi domain proteins such as DHFR. The two dissected domains of 
DHFR can fold separately and only need to be brought into close proximity by attached 

20 proteins. There is no precedent for rational dissection of a single domain protein such as 
GFP that can be accomplished based upon the WO 98/34120. WO 98/34120 does not 
teach how to rationally dissect single domain proteins that can be subsequently 
reassembled. Finally, the ability to identify and characterize appropriate sites for 
dissecting a single domain protein is not validated or demonstrated in WO 98/34120. 

25 [0018] US Patent 6,180,343 relates to the use of fluorescent proteins, particularly green 
fluorescent protein (GFP), in fusion constructs with random and defined peptides and 
peptide libraries, to increase the cellular expression levels, decrease the cellular 
catabolism, increase the conformational stability relative to linear peptides, and to 
increase the steady state concentrations of the random peptides and random peptide 
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library members expressed in cells for the purpose of detecting the presence of the 
peptides and screening random peptide libraries. The patent does not contemplate the use 
of antiparallel leucine zipper for reconstituting GFP nor the use of peptides that associate 
with each other to reconstitute GFP and to provide a detection signal. 

5 

SUMMARY OF THE INVENTION 

[0019] The present invention provides protein complexes comprising a first and second 
peptide, each of said peptides being joined, operably linked, or fused to a heterologous 
helical domain, said helical domains being noncovalently associated to form an 

10 antiparallel leucine zipper. The peptides of the protein complexes form a functional 
signaling moiety such as a reporter, a marker, or a biosensor upon non-covalent 
association of the helical domains into an antiparallel leucine zipper. In one embodiment, 
each of the peptides is joined to a helical domain via a linker. In a preferred embodiment, 
each of the helical domains comprises an amino acid sequence as set forth in SEQ ID NO: 

15 1 or SEQ ID NO: 2. Preferably, each of the first and second peptides comprises a distinct 
portion of green fluorescent protein (GFP). 

[0020] In one aspect, the present invention provides fusion proteins comprising a 
peptide and a helical domain, said helical domain forming an antiparallel leucine zipper 
when it noncovalently associates with a complementary helical domain. The helical 

20 domain is a heterologous or distinct protein or polypeptide fragment, relative to the 
peptide of the fusion protein. The fusion protein may further comprise a linker moiety 
interposed between the peptide and the helical domain. In a preferred embodiment, the 
peptide comprises a peptide derived from green fluorescent protein (GFP). 
[0021] In another aspect, the present invention provides nucleic acids encoding fusion 

25 proteins comprising a peptide and helical domain, said helical domain forming an 

antiparallel leucine zipper when it noncovalently associates with a complementary helical 
domain. 

[0022] The present invention provides a method of assembling a protein complex 
comprising (a) providing first and second helical domains that non-covalently associate 
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to form an antiparallel leucine zipper; (b) providing first and second peptides; (c) 
producing fusion proteins by separately fusing said first helical domain to said first 
peptide and said second helical domain to said second peptide; and, (d) allowing the 
fusion proteins to form a protein complex mediated by the non-covalent association of the 
5 first and second helical domains into an antiparallel leucine zipper. The first and second 
peptides are distinct peptides. Preferably, they are distinct peptides derived from GFP, 
such that they comprise different GFP fragments. 

[0023] In one embodiment of the disclosed method of assembling a protein complex, 
the protein complex comprises a signaling moiety and the helical domains comprise a 

10 leucine rich hydrophobic core. The helical domains may further comprise acidic residues 
and basic residues. The helical domains may further comprise a buried asparagine 
residue. The pair of helical domains preferably have the amino acid sequences as set 
forth in SEQ ID NO: 1 and SEQ ID NO: 2. In an alternative embodiment of the method, 
the step of producing the fusion proteins further comprises interposing a linker moiety 

15 between the peptide and the helical domain. 

[0024] The present invention also provides a method of identifying a polypeptide that 
interacts with a known polypeptide comprising (a) producing a first fusion protein 
comprising the known polypeptide linked to a first GFP fragment; (b) producing a second 
fusion protein comprising a test polypeptide linked to a second GFP fragment, wherein 

20 association of the first and second GFP fragments results in a GFP that exhibits detectable 
fluorescence; (c) allowing the first fusion protein to associate with the second fusion 
protein to form a complex mediated by the non-covalent association of the known 
polypeptide and test polypeptide; and, (d) detecting whether, or to what extent, 
association of first and second GFP fragments occcurs, wherein association of GFP 

25 indicates that the test polypeptide interacts with the known polypeptide. Preferably, the 
first GFP peptide is NGFP and the second GFP peptide is CGFP. 
[0025] In one aspect, the present invention provides a method of identifying a 
polypeptide that interacts with a known polypeptide comprising (a) producing a nucleic 
acid encoding a fusion protein comprising the known polypeptide linked to a first GFP 
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fragment; (b) producing a plurality of nucleic acids encoding fusion proteins comprising a 
test polypeptide linked to a second GFP fragment, wherein association of the first and 
second GFP fragments results in a GFP that exhibits detectable fluorescence; (c) 
cotransforming or cotransfecting the nucleic acids of steps (a) and (b) into a host cell for 
5 expression of the encoded fusion proteins; (d) selecting colonies that exhibit fluorescence; 
and, (e) culturing the selected colonies to identify the test polypeptides that interact with 
the known polypeptide. 

[0026] In a preferred embodiment of the constructs and methods of the present 
invention, the first GFP peptide is NGFP and the second GFP peptide is CGFP. Also, 
10 preferably, the nucleic acids of step (b) of the foregoing identification step are produced 
in the form of a combinatorial library. 

[0027] In another aspect, the present invention provides a method of identifying a 
molecule that inhibits the activity of a known protein comprising (a) producing a first 
fusion protein comprising a first known polypeptide linked to a first GFP fragment; (b) 

15 producing a second fusion protein comprising a second polypeptide linked to a second 
GFP fragment, wherein the second polypeptide is known to interact with the first 
polypeptide and wherein association of the first and second GFP fragments results in a 
GFP that exhibits detectable fluorescence; (c) allowing the first fusion protein to associate 
with the second fusion protein to form a GFP complex mediated by the non-covalent 

20 association of the first and second polypeptide; (d) incubating a test molecule with the 
GFP complex; and, (e) detecting disassembly of the complex, wherein disassembly of the 
complex indicates that the test molecule inhibits the activity of the known protein. 
Preferably, the first GFP peptide is NGFP and the second GFP peptide is CGFP. 
[0028] The present invention also contemplates a method of detecting protein-protein 

25 interactions comprising (a) producing a first fusion protein comprising a known 
polypeptide linked to a first GFP fragment; (b) producing a second fusion protein 
comprising a test polypeptide linked to a second GFP fragment, wherein association of 
the first and second GFP fragments results in a GFP that exhibits detectable fluorescence; 
(c) allowing the first fusion protein to associate with the second fusion protein to form a 
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complex mediated by the non-covalent association of the known polypeptide and test 
polypeptide; and, (d) detecting reassembly of GFP, wherein reassembly of GFP indicates 
that the test polypeptide interacts with the known polypeptide. 

[0029] A related method may further comprise obtaining nucleic acids encoding the first 
5 and second fusion proteins and cotransfecting or cotransforming the nucleic acids into a 
cell to obtain the first and second fusion protein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0030] Figure 1 shows the strategy for antiparallel leucine zipper directed protein 
10 reassembly of GFP (Kraulis, P. J., 1991, J. Appl Crystallog., 24, 946-950). Both the 
ribbon and topographical structures are depicted: GFP is shown in green, NZGFP is 
shown in blue and CZGFP is shown in red. The sequences of the designed leucine 
zippers, NZ and CZ, are ALKKELQANKKELAQLKWELQALKKELAQ (SEQ ID NO: 
1) and EQLEKKLQALEKKLAQLEWKNQALEKKLAQ (SEQ ID NO: 2) respectively. 

15 

[0031] Figure 2 shows fluorescence binding isotherm for the interaction of NZGFP 
with CZGFP monitored at 505 nm. Inset shows the normalized fluorescence excitation 
and emission of the reconstituted NZGFP.CZGFP complex. 

20 [0032] Figure 3 shows in vitro reconstitution of GFP demonstrated by (a) green 

fluorescent BL21(DE3) cells and the corresponding SDS gels of (b) lanel: MW markers; 
lane 2: protein from cotransformed green colony; and lane 3: protein from colony 
containing only NZGFP plasmid and (c) lanel: MW markers; protein from cotransformed 
green colony; and lane 3: protein from colony containing only CZGFP plasmid. 

25 

[0033] Figure 4 shows the antiparallel leucine zipper pairs attached to CGFP and NGFP 
are shown in helical wheel representations. The Lys (K) residues are colored blue and 
complementary Glu (E) residues are colored red. The pairs a and b are electrostatically 
matched and the pairs c and d are electrostatically mismatched. The inset shows restreaks 
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of single Escherichia coli colonies corresponding to each pair. EK-CGFP is the same as 
CZGFP, and EK-NGFP is the same as NZGFP. 

[0034] Figures 5 A-C show fluorescence based selection. A. The "prey" leucine zipper 
5 attached to CGFP is randomized (X) at the e and g positions of the helix with either Lys 
(K) or Glu (E). and the "bait" leucine zipper attached to NGFP contains only Glu (E) 
residues at both e and g positions. B. SDS gel showing protein expression profiles of 10 
cotransformed green fluorescent colonies from the selection. C. Tabulation of the 
residues selected in the library leucine zipper (XX-CGFP) by screening for fluorescence 
10 of cotransformed Escherichia coli cells. 

[0035] Figure 6 shows fluorescence spectra for the dominant negative inhibition of 
NGFP/CGFP complex formation as a function of added EK leucine zipper inhibitor (SEQ 
ID NO: 1). Inset shows the inhibition profiles of the EK leucine zipper, NGFP, and 
15 CGFP peptides. 

DETAILED DESCRIPTION 

1. General Description 
[0036] The present invention is based on the finding that the dissection and subsequent 
20 reassembly of a protein from peptidic fragments provides an avenue for controlling the 
protein's tertiary structure and hence its function. 

[0037] The present invention is based in part on the surprising discovery of a general 
method for the reassembly of protein fragments mediated by the non-covalent association 
of antiparallel leucine zippers (Lupas, A., 1996, Trends Biochem. Sc. 21, 375-382; Kohn, 
25 W. D. et al. 9 1997, J. Biol. Chem. 272, 2583-2586; Bryson, J. W. et al, 1995, Science, 

270, 935-941). Specifically, the present invention discloses a strategy for the noncovalent 
reconnection of the N- and C- termini of a dissected surface loop of a protein by means of 
antiparallel leucine zippers (Figure 1) (Kraulis, P. J., 1991, J. Appl Crystallog., 24, 946- 
950). The present invention demonstrates the successful application of this 
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oligomerization strategy, both in vitro and in vivo, to the 238 residue green fluorescent 
protein (GFP) from Aequorea victoria (Tsien, R. Y., 1998, Annu. Rev. Biochem., 67, 
509-544). GFP provides an easily testable system for correct reassembly by virtue of its 
autocatalytically generated fluorescence, which is intimately linked to its properly folded 
5 structure (Ormo, M. et al., 1996, Science, 273, 1392-1395; Reid, B. G. et al, 1997, 
Biochemistry, 36, 6786-6791; Miyawaki, A. et al, 1997, Nature, 388, 882-887; 
Miesenbock, G. etal, 1998, Nature, 394, 192-195;). 

[0038] The present invention is also based in part on the discovery of an effective 
strategy involving linking fragments of an enzyme to potentially interacting protein- 
ic) partners such that functional enzyme reassembly only occurs on formation of a strong 

protein-protein complex. In one aspect, the present invention establishes the selectivity of 
the GFP reassembly mediated selection of interacting proteins (GRIP) assay and applies it 
to the in vivo colorimetric selection of complementary leucine zipper pairs from 
combinatorial libraries in Escherichia coli. In another aspect, the present invention 
15 demonstrates the applicability of the GRIP assay to monitor the disruption of protein- 
protein interactions by a dominant negative approach. Accordingly, the present invention 
provides an assay system that has the potential to monitor protein-protein interactions in 
their natural environment within a cell and are not limited to the nucleus as are classic 
yeast two-hybrid systems (Fields, S. et al, 1989, Nature, 245-246). 

20 

2. Definitions 

[0039] As used herein, "active protein complex" refers to a protein complex comprising 
two or more peptides and retaining substantially all the functional activity of the native 
protein from which the peptides are obtained. 
25 [0040] As used herein, "covalent bond" refers to an interatomic bond characterized by 
sharing of electrons. 

[0041] As used herein, "fusion protein" or "chimeric protein" refers to a hybrid protein, 
which consists of two or more proteins, or fragments thereof, linked together covalently. 
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A fusion protein may comprise two or more peptides or proteins from different animals, 
origins, or species. 

[0042] As used herein, "helical domain" refers to a protein or polypeptide fragment or a 
peptide having a V helix or a coiled configuration. 
5 [0043] As used herein, "heterologous protein or peptide" refers to a protein or peptide 
derived from a different origin, animal, or species. Heterologous proteins or peptides are 
not operably linked in their naturally occurring or native form. 

[0044] As used herein, "noncovalent association" refers to molecular interactions that 
do not involve an interatomic bond. Noncovalent interactions involve, for example, ionic 
10 bonds, hydrogen bonds, hydrophobic interactions, and van der Waals forces. 

Noncovalent forces may be used to hold separate polypeptide chains together in proteins 
or in protein complexes. 

[0045] As used herein, "protein complex" refers to a combination of two or more 
proteins into a larger molecule without covalent bonding. 
15 [0046] As used herein, "random peptide" refers to an oligomer composed of two or 
more amino acid residues and constructed by a means with which one does not preselect 
the complete sequence of a particular oligomer. 

[0047] As used herein, "random peptide library" or a "combinatorial library" refers a 
library comprising not only of a set of recombinant DNA vectors (also called 
20 recombinants) that encodes a set of random peptides, but also of random peptides 
encoded by those vectors, as well as the fusion proteins containing those random 
peptides. 

[0048] As used herein, "signaling moiety" refers to a moiety that acts to cause an action 
such as a signal. The moiety may signal as a result of an enzymatic reaction, light 
25 absorption, or other means. 
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3. Specific Embodiments 

A. GFP as a System for Protein Reassembly and Fragment Complementation 

Assay 

[0049] The present invention is based in part on the use of GFP as model for protein 
5 reassembly and fragment complementation based assays. GFP provides an ideal system 
for these assays because the reassembled protein autofluoresces and is easily visualized 
and amenable to fluorescence activated cell sorting (Tsien, R. Y., 1998, Annu. Rev. 
Biochem. 67, 509-544; Misteli, T. et al, 1997, Nat. Biotechnol. 15, 961-964). GFP 
fluorescence does not require the addition of other cellular factors, substrates, or 

10 additional gene products from A. victoria. Moreover, GFP can be expressed and detected 
in various cells and organisms and is not localized to a specific organelle of a cell upon 
expression. Additionally, unlike the DHFR assay, detection of GFP expression is not 
dependent upon survival or death of host cells. Nor is the expression of GFP dependent 
upon the addition of cofactors as in the 3-galactoside assay or of other cellular 

15 components as in the ubiquitin assay. It is also not toxic to mammals and has been 
expressed in monkeys (Chan et aL, 2001, Science, 291, 309). Further, the multiple 
variants of GFP available for use in different organisms and cell-types makes it an ideal 
protein candidate for development of a general assay such as the GRIP assay described 
below. 

20 [0050] Various mutations in GFP leading to brighter emission following 488 nm 

excitation have been generated. Mutations in GFP which shift the excitation maximum 
from 395 nm to about 490 nm have been reported by Delagrave et al. (1995, 
Biotechnology, 13,151) and Heim et al (1995, Nature, 373, 663). Mutants with Ala, 
Gly, He, Cys or Thr substituted for Ser65 have large shifts in excitation maxima, and 

25 fluoresce more intensely than wild-type protein when excited at 488 nm. The mutation of 
Ser65 to Thr or Cys has been observed to increase by a factor of 6 the fluorescence of 
GFP following 488 nm excitation. Heim et. al. (1994, Proc. NaVlAcad. Sci USA, 91, 
12501-12504) describe a mutant that fluoresces blue and contains a histidine in place of 
Tyr66. Delagrave et al. (1995, Bio. Technology, 13, 151-154) report on several Aequorea 
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GFP variants that showed red-shifted excitation spectra, i.e., shift in excitation maxima 
from 393 nm to 498 nm. Delagrave et al. hypothesize that co-expression of GFP and red- 
shifted GFP (RSGFP) will enable the analysis of two proteins or promoters per cell or 
organism. 

5 [0051] U.S. Patent 6,096,865 describes GFP mutants with improved solubility 

properties at higher temperatures and are able to fluoresce at 37 °C. Specifically, the 
patent provides a GFP mutant in which phenylalanine at original amino acid position 64 
is replaced by a leucine. This mutant has the ability to fluoresce at 37 °C. Other mutants 
with altered spectra are disclosed by Heim et al. (1994, Proc. Nat'l Acad. Sci USA, 91, 
10 12501-12504 and 1995, Nature, 373, 663). 

[0052] The present invention contemplates the use of various GFP mutants in the 
protein complementation assay and protein reassembly assay described in detail below. 
The preferred GFP mutant is the sglOO GFP variant described below. 

15 B. Methods for Reassembly of Fragments into a Functional Protein 

[0053] The present invention also is based in part on the discovery that an antiparallel 
leucine zipper is useful for in vitro reassembly of protein fragments into a functionally 
active protein. Specifically, a GFP variant (sglOO) which has a single excitation and 
emission maximum at 475 nm and 505 nm respectively, was dissected and refolded using 

20 an antiparallel leucine zipper. The GFP variant, sglOO, was dissected at a surface loop 
between residues 157 and 158. A pair of helices, NZ and CZ (SEQ ID NO: 1 and 2), 
capable of forming an antiparallel leucine zipper was designed and fused to the dissected 
GFP fragments via linkers to form NZGFP (N-terminal GFP) and CZGFP (C-terminal). 
Under conditions routinely used for folding denatured GFP, NZGFP and CZGFP 

25 reassembled properly to form a functionally active GFP. The wavelengths, 8 max , for 
fluorescence excitation and emission spectra were identical to that of the parent GFP 
(Figure 2). 

[0054] The present invention is also based in part on the discovery that an antiparallel 
leucine zipper is useful for in vivo reassembly of protein fragments into a functionally 
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active protein. Specifically, equimolar amount of plasmids encoding NZGFP and CZGFP 
were transformed into E. coli cells. Colonies that turned green (Figure 3, panel a) were 
selected and further cultured in liquid media for analysis of the protein expression pattern. 
As shown in figure 3, panels b and c, the green colonies expressed similar amounts of 
5 NZGFP and CZGFP, whereas the non- fluorescent colonies contained either NZGFP or 
CZGFP. Moreover, control cotransformation experiments with NGFP/CGFP, 
NGFP/CAFP, and NZFP/CGFP did not have any green colonies. Accordingly, the 
presence of both NZ and CZ leucine zippers are required to mediate GFP assembly in 
vivo and in vitro. 

10 [0055] The present invention contemplates the use of the antiparallel leucine zipper to 
refold, reconstitute, or reassemble proteins from peptides. Moreover, the ability to 
reconstitute GFP from its peptide fragments can be extended to an in vivo fragment 
complementation assay for the selection of antiparallel leucine zippers as has been 
demonstrated for parallel leucine zippers with DHFR (Pelletier, J. N. et al 9 1999, Nat. 

15 BiotechnoL, 17, 683-690). As described below, fragmented GFP can be used to study the 
in vivo interaction of protein-protein pairs which have their N and C termini in close 
proximity (Pelletier, J. N. et al, 1998, Proc. Natl. Acad. Sci. U.S.A., 95, 12141-12146). 
More generally, the protein reassembly strategy of the present invention may have 
applications such as the selective isotopic labeling of one fragment of a large protein for 

20 NMR analysis, or the mutagenesis of a limited region of a protein as demonstrated for 
inteins (Cotton, G. J. et al, 1999, J. Am. Chem. Soc, 121, 1 100-1 101; Cotton, G. J. et aL, 
1999, Chem. Biol, 6, R247-R256; Muir, T. W. etal, 1998, Proc. Natl. Acad. Sci. U.S.A., 
95, 6705-6710; Xu, R. etal, 1999, Proc. Natl Acad. Sci. U.S.A., 96, 388-393). Further, 
the engineering of an on/off switch for the activity of fragmented proteins by designing a 

25 leucine zipper heterodimer which can be reversibly assembled or disassembled by 
controlling the environmental conditions is also contemplated (Zutshi R. et al, 1998, 
Curr. Opin. Chem. Biol, 2, 62-66; Yao, S. etal, 1998, Nature, 396, 447-450; Krylov, D. 
etal, 1994,EMB0J., 13,2849-2861). 



30 
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C. GRIP Assay and Combinatorial Selection 
[0056] The present invention is also based on the selectivity of the GFP reassembly 
mediated selection of interaction proteins. Based on this selectivity, the present invention 
developed the GRIP assay (GFP reassembly mediated selection of interacting proteins or 
5 peptides) and applied the assay to the in vivo colorimetric selection of complementary 
leucine zipper pairs from combinatorial libraries. 

[0057] Specifically, the inventors having established that the GRIP assay was selective 
for high affinity LZ (leucine zipper) pairs, tested the applicability of the assay in the 
combinatorial selection of LZ pairs that would interact strongly enough to promote GFP 

10 reassembly (Figure 5 A). This would extend the GRIP system for selection of protein 
partners as had been demonstrated for other fragment reassembly systems (Pelletier, J.N. 
et ai, 1999, Nat. Biotechnol. 17, 683-690). A simple experiment in which the acidic LZ 
containing N-terminal GFP fragment (EE-NGFP) was kept constant was chosen. A 
library of LZ partners that could either code for Glu or Lys with equal probability at the e 

15 and g "specificity" positions (Figure 5 A) was generated. This library was fused to the C- 
terminal GFP fragment (XX-CGFP). The plasmid encoded library of XX-CGFP and EE- 
NGFP were cotransformed into host cells, and colonies that exhibited fluorescence were 
selected and analyzed by sequencing. As expected, there was an overall enrichment of 
Lys residues as the selected parner for complementing the acidic EE-NGFP. The 

20 electrostatic pairing of Lys/Glu is required for stabilizing the leucine zipper. 

[0058] The present invention demonstrates that the GRIP assay is selective for specific 
protein pairs in vivo and is amenable for the selection of complementary protein pairs in 
vivo. 

25 D. GRIP Assay and its Use in Detection of Inhibitors or Protein-Protein 

Interactions 

[0059] The present invention is further based in part on the discovery that the GRIP 
assay is useful for assaying the disruption of protein-protein interactions in vitro. The 
GRIP can be utilized for identifying inhibitors of protein-protein interactions. 
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Specifically, a LZ peptide (SEQ ID NO: 1) was incubated with NGFP/CGFP complex. 
The sample was monitored for fluorescence as a function of added peptide (Figure 6). 
The LZ peptide (SEQ ID NO: 1) prevented the assembly of the complex (4 :M) with an 
IC 50 value of 3 1 :M. Control experiments with addition of either NGFP or CGFP 
5 fragments that lacked leucine zippers did not prevent reassembly of NZGFP/CZGFP 
complex (Figure 6). 

E. Applications of the GRIP Assay 
[0060] The present invention is based on the development of a visually detectable 

10 colorimetric system for studying the assembly and disassembly of protein partners. This 
system can be used for high-throughput screening, for example, screening using 
fluorescence activated cell sorting in yeast (Winson, M. K. et ai, 2000, Methods, 21, 231- 
240 (2000).). Further, the system can be practiced using protein three-hybrid detection 
system, with two interacting proteins fused to respective fragments of a donor GFP 

15 variant and a third protein fused to an acceptor GFP variant, thus allowing for in vivo 
fluorescence resonance energy transfer measurements (Tsien, R. Y., 1998, Annu. Rev. 
Biochem., 67, 509-544; Pollok, B. A. etal., 1999, Trends Cell Biol., 9, 57-60). 
[0061] In the GRIP assay, the emitted light can be analyzed by visual screening, a flow 
sorter (FACS), a spectrophotometer, a microtiter plate reader, a charge coupled devise 

20 (CCD) array, a fluorescence microscope, or other similar devices. 

[0062] The GRIP assay may be performed in using a multiwell format. Typically, wells 
are arranged in two dimensional linear arrays with greater than 864 wells on a standard 
microtiter plate footprint. Other commonly used numbers of wells include 1536, 3456, 
and 9600. Well volumes typically vary from 500 nanoliters to over 200 microliters, 

25 depending on well depth and cross sectional area. Well volumes of 1, 2, 5, 10, 20, and 50 
microliters are commonly used. Wells can be made in any cross sectional shape (in plan 
view) including, square, round, and hexagonal and combinations thereof. Wells can be 
made in any cross sectional shape (in vertical view), including shear vertical walls with 

l-WA/1611537.1 



44574-5100-US 



20 



flat or round bottoms, conical walls with flat or round bottoms and curved vertical walls 
with flat or round bottoms and combinations thereof. 

[0063] US Patent 6,229,603 provides multi-well plates with greater than 864 wells that 
comprise a layer of cycloolefin having low fluorescence and high transmittance. These 
5 multi-well plates are particularly well suited for fluorescence measurements. 

[0064] The GRIP assay may be used to study protein-small molecule interactions. 
Alternatively, the assay may be used to investigate protein-protein interactions and to 
screen libraries for identification of binding molecules. Examples of protein-protein 
interactions include, but are not limited to, antigen/antibody, ligand/receptor, antagonist 

10 or inhibitor/protein, binding protein/protein, and enzyme/substrate. 

[0065] Further, the GRIP assay may be used to investigate other macromolecular 
interactions. A known DNA or RNA binding protein, "A" (that binds a RNA or DNA 
sequence "X"), is fused to one fragment of GFP, for example NGFP, and a second 
putative RNA or DNA binding protein from library "Z" is fused to, for example CGFP. 

15 In an in vivo or in vitro system the DNA or RNA component ("X-Y") that is being 

assayed for will have the DNA or RNA sequence "X" attached to a second DNA or RNA 
sequence Y whose protein target is being sought from library "Z" When binding or 
complexing occurs between "X" and A-NGFP and "Y" with a protein from the X-CGFP 
library, fluorescence will be established. 

20 [0066] Variations upon this can be used to identify carbohydrate-protein partners or 
small molecule protein partners by making appropriate changes in the NGFP fused 
protein A (which can be chosen to bind carbohydrate or small molecule components). 
[0067] This assay may also be used to investigate libraries of DNA, RNA, 
carbohydrates, peptides or other small molecules. In this situation "X-Y" can be a 

25 library. "X" is held constant with a known DNA, RNA, carbohydrate, or small molecule 
that binds a protein, "A", and "Y" can be varied as desired. The fusion proteins A-NGFP 
and Z-CGFP can also be held constant. "Y" is to identified and is a molecule that binds 
Z-CGFP. Establishing fluorescence will indicate identification of a DNA, RNA, 
carbohydrates, or small molecules component Y that binds protein Z. 
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F. Combinatorial Libraries 
[0068] A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis, by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial 
5 chemical library such as a polypeptide library is formed by combining a set of chemical 
building blocks (amino acids) in every possible way for a given compound length (i.e., 
the number of amino acids in a polypeptide compound). Millions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks. 
[0069] Preparation and screening of combinatorial chemical libraries is well known to 

10 persons of skill in the art. Such combinatorial chemical libraries include, but are not 
limited to, peptide libraries (see, e.g., U.S. Pat. No. 5,010,175, Furka, 1991, Int. J. Pept. 
Prot. Res. 37, 487) and Houghton et al, 1991, Nature 354, 84). Other chemistries for 
generating chemical diversity libraries can also be used. Such chemistries include, but 
are not limited to: peptoids (PCT Publication No. WO 91/19735), encoded peptides (PCT 

15 Publication WO 93/20242), random bio-oligomers (PCT Publication No. WO 92/00091), 
benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as hydantoins, 
benzodiazepines and dipeptides (Hobbs etal 9 1993, Proc. Nat. Acad. ScL USA 90, 6909), 
vinylogous polypeptides (Hagihara et al, 1992, Amer. Chem. Soc. 1 14, 6568), 
nonpeptidal peptidomimetics with 3-D-glucose scaffolding (Hirschmann et al, 1992, J. 

20 Amer. Chem. Soc. 1 14, 9217), analogous organic syntheses of small compound libraries 
(Chen et al, 1994, J. Amer. Chem. Soc. 1 16, 2661), oligocarbamates (Cho et al, 1993, 
Science 261, 1303), and/or peptidyl phosphonates (Campbell et al, 1994, J. Org. Chem, 
59, 658), nucleic acid libraries, peptide nucleic acid libraries (U.S. Pat. No. 5,539,083), 
antibody libraries (Vaughn et al, 1996, Nature Biotechnology 14(3), 309 and 

25 PCT/US96/10287), carbohydrate libraries (Liang et al, 1996, Science 274, 1520 and U.S. 
Pat. No. 5,593,853), small organic molecule libraries (benzodiazepines, Baum, C&EN 
January 18, page 33 (1993); isoprenoids, U.S. Pat. No. 5,569,588; thiazolidinones and 
metathiazanones, U.S. Pat. No. 5,549,974; pyrrolidines, U.S. Pat. Nos. 5,525,735 and 
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5,519,134; morpholino compounds, U.S. Pat. No. 5,506,337; benzodiazepines, U.S. Pat. 
No. 5,288,514, and the like). 

[0070] Devices for the preparation of combinatorial libraries are commercially available 
(see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville Ky., Symphony, Rainin, 
5 Woburn, Mass., 433A Applied Biosystems, Foster City, Calif., 9050 Plus Millipore, 
Bedford, Mass.). In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, 
Tripos, Inc., St. Louis, Mo., ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, 
Pa., Martek Biosciences, Columbia, Md., etc.). 

10 [0071] The small molecules of a small molecule combinatorial library may be selected 
from at least one of the group consisting of amino acids, peptides, oligonucleotides, and 
heterocyclic compounds. The present invention contemplates contemplates combinatorial 
libraries of small molecules that are naturally occurring or synthetic. 
[0072] Suitable peptides comprise as few as two amino acids to as many as about 30; 

15 preferably, suitable peptides comprise from about two amino acids to about fifteen; most 
preferably, suitable peptides comprise from about two amino acids to about ten. Any 
amino acid may be incorporated into peptides screened and identified using the present 
invention, including any combination of the naturally occurring proteinogenic amino 
acids as well as amino acids not naturally occurring in proteins such as, but not limited to, 

20 dextrorotatory forms of the known amino acids, for example. 

[0073] Suitable oligonucleotides consist of as few as two nucleotides to as many as 
about 50; preferably, suitable oligonucleotides consist of from about five nucleotides to 
about 30; most preferably, suitable oligonucleotides consist of from about five 
oligonucleotides to about 15. Any nucleotide may be incorporated into an 

25 oligonucleotide to be screened and identified using the present invention, including any 
combination of the naturally occurring deoxyribonucleotides and ribonucleotides as well 
as those not naturally occurring in biological systems, such as, but not limited to, H- 
phosphonate derivatives,JN-blocked-5'-0-DMT-deoxynucleoside 3 -(2-cyanoethyl-N,N- 
diisopropyl)phosphoramidites, N-blocked-5-O-DMT-deoxynucleoside 3 ! -(2-cyanoethyl- 
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N,N-diisopropyl)phosphoramidites, N-blocked-5'-0-DMT-deoxynucleoside 3 f -(methyl- 
N,N-diisopropyl) phosphoramidites, N-blocked-5'-0-DMT-deoxynucleoside 3'-(2- 
chlorophenyl) phosphates, N-blocked-5'-0-DMT-deoxynucleoside 3'-(2-chlorophenyl 2- 
cyanoethyl) phosphate, all of which are nucleoside derivatives used in oligonucleotide 
5 synthesis. 

[0074] Suitable heterocyclic compounds consist of, at minimum, a single four 
membered ring to as much as a multiple of four membered or greater membered rings 
coupled by carbon chains of 1 to about 20 atoms in length, such chains being saturated or 
not. Preferably, suitable heterocyclic compounds include a single four- to seven- 

10 membered ring, as well as, but not limited to varying combinations of 5, 6, or 7 

membered rings having varying numbers of N, S, or O atoms. Examples of suitable 
heterocyclic compounds include benzodiazepine and derivatives thereof (as, for example, 
disclosed in Bunin et al, 1992, J. Am. Chem. Soc. 1 14, 10997),_penicillins, 
cephalosporins, and folate derivatives. 

15 [0075] For ease of identification, the molecules in a small molecule combinatorial 
library may be tagged for decoding their identity. 

[0076] The GRIP assay may be used to screen mixed libraries. Mixed libraries of small 
molecules comprising amino acids, peptides, oligonucleotides, and heterocyclic 
compounds that are S'-hydroxyl derivatives of the oligonucleotides may be used. The 

20 peptide end of members of a peptide library can be modified to include a carboxyl group. 
A process of esterification of the carboxyl group with the 5'-hydroxyl of the 
oligonucleotide is used to produce a mixed library containing peptide-oligonucleotide 
species. Brenner et ah, (1992, Proc. Nat'lAcad. Sci. USA 89, 5381) also describes a 
method of preparation of mixed libraries having nucleotides and peptides. A mixed 

25 library comprising a heterocyclic compound and a peptide is also prepared by the reaction 
of suitable functional groups present on the heterocyclic compound. For instance, the 
carboxyl group on a heterocyclic compound is reacted with the amino group on the 
peptide to provide an amide linkage. 
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[0077] Preferably, the GRIP assay of the present invention is used to screen peptide 
libraries. A comprehensive review of various types of peptide libraries can be found in 
Gallop^/., 1994,/. Med. Chem. 37:1233-1251. The use of peptide libraries is well 
known in the art. Peptide libraries have generally been constructed by one of two 
5 approaches. 

[0078] In the first approach, peptides have been chemically synthesized in vitro in 
several formats. For example, Fodor et al (1991, Science 251, 767) describe use of 
complex instrumentation, photochemistry and computerized inventory control to 
synthesize a known array of short peptides on an individual microscopic slide. Houghten 

10 et al (1991, Nature, 354, 84) describe mixtures of free hexapeptides in which the first 
and second residues in each peptide were individually and specifically defined. Lam et 
al. (1991, Nature 354, 82) describe a "one bead, one peptide" approach in which a solid 
phase split synthesis scheme produced a library of peptides in which each bead in the 
collection had immobilized thereon a single, random sequence of amino acid residues. 

15 [0079] In the second approach, peptides are expressed in biological systems as either 
soluble fusion proteins or viral capsid fusion proteins. A number of peptide libraries have 
been generated using the Ml 3 phage. Ml 3 is a filamentous bacteriophage that has been 
routinely used in molecular biology laboratories for the past 20 years. Ml 3 viral particles 
consist of six different capsid proteins and one copy of the viral genome, as a single- 

20 stranded circular DNA molecule. Once the Ml 3 DNA has been introduced into a host 
cell such as E. coli, it is converted into double-stranded, circular DNA. The viral DNA 
carries a second origin of replication that is used to generate the single-stranded DNA 
found in the viral particles. During viral morphogenesis, there is an ordered assembly of 
the single-stranded DNA and the viral proteins, and the viral particles are extruded from 

25 cells in a process much like secretion. The Ml 3 virus is neither lysogenic nor lytic like 
other bacteriophage (e.g., 8); cells, once infected, chronically release virus. This feature 
leads to high titers of virus in infected cultures, i.e., 10 12 pfu/ml. 
[0080] In a preferred embodiment, a GFP peptide comprising a fragment of GFP is 
fused to a random peptide to form a fusion polypeptide. By "fused" or "operably linked" 
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herein is meant that the random peptide and the GFP, are linked together, in such a 
manner as to minimize the disruption to the stability of the GFP structure, Le. it retains 
fluorescence. The GFP fusion polypeptide of the present invention can comprise further 
components such as linkers or fusion partners. 
5 [0081] The peptides (and nucleic acids encoding them) are randomized, either fully 

randomized or they are biased in their randomization, e.g. in nucleotide/residue frequency 
generally or per position. As used herein "fully randomized" means that each nucleic 
acid and peptide consists of essentially random nucleotides and amino acids, respectively. 
The nucleic acids which give rise to the peptides are chemically synthesized, and thus 

10 may incorporate any nucleotide at any position. Thus, when the nucleic acids are 

expressed to form peptides, any amino acid residue may be incorporated at any position. 
The synthetic process can be designed to generate randomized nucleic acids, to allow the 
formation of all or most of the possible combinations over the length of the nucleic acid, 
thus forming a library of randomized nucleic acids. 

15 [0082] Alternatively, the peptide library is biased. In this case, some positions within 
the sequence are either held constant, or are selected from a limited number of 
possibilities. Individual residues may be fixed in the random peptide sequence to create a 
structural bias. For example, proline or bulky residues such as W, R, K, L, I, V, F or Y 
may be inserted to restrict the conformation of the peptide. Also, the library can be 

20 biased to a particular secondary structure such as the alpha-helical structure. Examples of 
helix forming residues include M, A, K, L, D, E, R, Q, F, I, and V. 
[0083] In a preferred embodiment, the bias is toward peptides that interact with the 
known classes of molecules. For example, it is known that SH-3 peptides binds to SH-3 
proteins. A large number of small molecule domains are known that are suitable as 

25 starting points for the generation of biased randomized peptides. Examples of such 
molecules, domains, or consensus sequences include, but are not limited to SH-2 
domains, SH-3 domains, pleckstrin, death domains, protease cleavage/recognition sites, 
enzyme inhibitors, enzyme substrates, and Traf., and leucine zipper consensus sequence. 
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[0084] As discussed above, a fusion partner or linker can be added to fuse the random 
peptides to a GFP peptide. Fusion partners or linkers can be synthetic or heterologous 
(not native to the host cell). Appropriate fusion partners include, but are not limited to 
peptides that are stability sequences that stabilize and protect the random peptide from 
5 degradation, linker sequences for decoupling the random peptide from the GFP fragment, 
structural sequences that restrict and stabilize the conformation of the random peptide, 
targeting sequences which allow localization of the peptide into a subcellular or 
extracellular compartment, and rescue sequences that allow the purification or isolation of 
the random peptide. 

10 [0085] In light of the foregoing general discussion, the specific examples presented 

below are illustrative only and are not intended to limit the scope of the invention. Other 
generic and specific configurations will be apparent to those persons skilled in the art. 

EXAMPLES 

General Materials and Methods 
[0086] GFP Variant: A variant of the naturally occurring GFP, which has a single 
excitation maximum at 475 run was chosen for dissection and reassembly. The GFP 
variant (sglOO) contains F64L, S65C, Q80R, Y151L, I167T and K238N mutations from 
wild type GFP, which leads to a single fluorescence excitation and emission maximum at 
475 nm and 505 nm respectively, similar to GFP-sg25 as described by Palm, G. J et al, 
1997, Nat. Struct Biol., 4, 361-365. 

[0087] Cloning and Purification Protocol for NZGFP, NGFP, CGFP, and CZGFP: 
The NZGFP, NGFP, CGFP and CZGFP coding DNA were obtained by PCR 
25 amplification of the GFP (sglOO) plasmid template using appropriate primers. The DNA 
fragments were cut with Nhel/BamHl and ligated into the pETl la vector. The DNA 
sequences of the NZGFP, NGFP, CGFP, and CZGFP containing clones were verified by 
dideoxyoligonucleotide sequencing at the Keck facility at Yale. The protein products 
were overexpressed in BL21(DE3) cells at 37 °C without IPTG induction. The cells were 
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lysed by sonication and the proteins were individually purified by passage over 2 
successive Q-sepharose columns and then over a Gel-filtration column. Fractions 
containing the protein of interest, as determined by SDS-PAGE, were pooled and 
dialyzed against 2mM DTT, 10 mM TrisHCl buffer at pH 7.2. Final purified yields of 
5 proteins were between 10-20 mg/ L. Protein molecular weights were verified by MALDI 
mass spectrometry to within 0.05% of the calculated molecular weight. Amino acid 
analysis of the proteins established the correct compositions and protein concentrations 
for further biophysical studies. 

10 [0088] Amino acid sequences ofNGFP, NZGFP, CGFP, and CZGFP: Leucine zippers 
are in bold and linker regions underlined. Note the 6 residue linker between the C- 
terminal ofNGFP and NZ and the 4 residue linker between CGFP and CZ. 

NGFP 

15 [0089] MASKGEELFTGWPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFIC 
TTGKLPVPWPTLVTTLCYGVQCFSRYPDHMKRHDFFKSAMPEGYVQERTIFFKD 
DGNYKTRAEVKFEGDTLVNPJELKGIDFKEDGMLGHKLEYNYNHNVLIMADKQ 
GGSGSG (SEP ID NO: 3) 

20 NZGFP 

[0090] MASKGEELFTGWPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFIC 
TTGKLPVPWPTLVTTLCYGVQCFSRYPDHMKRHDFFKSAMPEGYVQERTIFFKD 
DGNYKTRAEVKTEGDTLVNRIELKGIDFKEDGMLGHKLEYNYNHNVLIMADKQ 
GG5GSGALKKELQANKKELAQLKWELQALKKELAQ (SEQ ID NO: 4) 

25 

CGFP 

[0091] IVIASGOS^CNGIKVNFKTRimiEDGSVQLADHYQQNTPIGDGPVLLPDNH 
YLSTQSALSKDPNEKRDHMVLLEFVTAAGITHGMDELYN (SEQ ID NO: 5) 
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[0092] CZGFP 

MASEQLEKKI,OALEKKLAOLEWKNOALEKKLAOGG^GK^GIKVNFKTRHNI 
EDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKJIDHMVLLEFVT 
AAGITHGMDELYN (SEQ ID NO: 6) 

[0093] Additional Constructs for GFP Reassembly: DNA constructs for EE-NGFP and 
KK-CZGFP coding DNA were obtained by PCR amplification of the GFP (sglOO) 
plasmid template using appropriate primers encoding the leucine zippers KK and EE 
whose sequences are AQLKEKLQALKEKLAQK WKLNALKEKLAQ (SEQ ID NO: 7) 
and ALEKELQANEKELAQLEWELQALEKELAQ (SEQ ID NO: 8) respectively. The 
DNA fragments were digested with Nhel/BamHl (New England Biolabs) and ligated into 
the pETl la vector. The DNA sequences of the EE-NGFP, and KK-CGFP containing 
clones were verified by automated sequencing at the Keck facility at Yale. 

15 [0094] Constructs for Library Selection: For leucine zipper library construction two 
overlapping degenerate oligonucleotides containing NAG (N= G or A) at all positions 
corresponding to Lys in the leucine zipper of KK-CGFP were synthesized such that they 
would code for either Lys or Glu with equal probability. The two overlapping 
oligonucleotides were mutually primed and extended using T7 Sequenase (Amersham) 

20 with 10 mM dNTPs. The product was purified from an agarose gel and subsequently 
ligated into the Nhel-Dralll (New England Biolabs) cassette present in a previously cut 
KK-CGFP plasmid. The resulting library, XX-CGFP, was transformed in 5 x 50 \iL of 
electrocompetent XL 1 -Blue cells (Stratagene) and selected for ampicillin resistance. The 
resulting pool of XX-CGFP plasmids was sequenced to verify that G/A were equally 

25 represented at sites of randomization. 

[0095] Colorometric Selection: For all reassembly experiments with NZGFP/CZGFP, 
NZGFP/KK-CGFP, EE-NGFP/CZGFP, and EE-NGFP/KK-CGFP: 1 ^g of each plasmid 
was cotransformed in 30 of BL21(DE3) cells and selected on ampicillin containing 

l-WA/l 611 537.1 



5 



10 



44574-5100-US 



29 



LB plates. The plates were incubated at 37 °C overnight and subsequently moved to the 
bench top (23 °C) for 2 day. The green color developed after 16-32 hours. The 
cotransformation efficiency was approximately 7 + 2 % as verified by growing up 
individual colonies and monitoring protein expression profiles, which corresponded well 

5 with visual inspection of green colonies in experiments with NZGFP/CZGFP and EE- 
NGFP/KK-NGFP. Non-fluorescent colonies that coexpressed either NZGFP/KK-CGFP 
or EE-NGFP/CZGFP were identified by screening 120 colonies of respective 
cotransformations by SDS gel for protein expression of both gene products. In library 
selections, 20 individual cotransformations of l|ig of XX-CGFP library plasmid with 1 

10 jag of EE-NGFP plasmid were carried out as described above. Sixteen colonies were 
selected from 102 green colonies of -4000 total colonies. The colonies were grown 
overnight in LB media and the plasmid DNA (XX-CGFP + EE-NGFP) purified and 
sequenced using primers unique to the XX-CGFP construct. 

Inhibition of Protein-Protein Interactions: The protein products for NZGFP, CZGFP, 
15 NGFP and CGFP were overexpressed in BL21(DE3) cells at 37 °C and purified as 

described above. Amino acid analysis of the proteins established the correct compositions 
and protein concentrations for fluorescence experiments. The inhibitor peptide 
corresponding to the leucine zipper of NZGFP (EK peptide) having the sequence 
ALKKJELQANKKELAQLKWELQALKKELAQ (SEQ ID NO: 1) was synthesized at the 
20 Keck facility (Yale University) and purified on a reverse phase C8 column(Vydac) by 

HPLC. Peptide concentrations were determined by Trp absorbance and verified by amino 
acid analysis. 

[0096J For inhibition experiments all fluorescence measurements were made in 
triplicate on a Hitachi F-4500 Fluorescence Spectrophotometer with excitation at 475 nm 
25 and emission at 505 nm. A 1 .2 mM stock solution of equimolar amounts of 

NZGFP/CZGFP was allowed to reassemble and fluoresce until there was no change in 
fluorescence (36 hours). The reassembled complex was denatured in 4 M GdmHCl for 4 
hours following which different concentrations of EK peptide, NGFP, or CGFP were 
added and the NZGFP/CZGFP concentration adjusted to 800 ^M. The samples were 
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diluted 200 fold into 2 mM DTT, 10 mM Tris.HCl buffer at pH 7.2 to a 20 mM final 
concentration of Gdm.HCl and allowed to refold and fluoresce. In order to eliminate 
artifacts from time dependent inhibition, fluorescence measurements were made after 4 
hours and after 16 hours and found to be constant. 

5 

Example 1: In Vitro Reassembly of GFP Using an Antiparallel Leucine 

Zipper 

1 . Design of Antiparallel Leucine Zipper 
[0097] Designs for helices, designated NZ and CZ, to form antiparallel leucine zippers 

10 for reassembly purposes were based upon sequences reported by Hodges, (11a) Kim, 
(Oshea, E. K. et aL, 1993, Current Biol, 3, 658-667) and Alber (Harbury, P. B. et aL, 
1994, Nature, 371, 80-84). The leucine zippers contained a Leu-rich hydrophobic core, 
acidic (Glu) and basic (Lys) residues to direct antiparallel heterodimer formation, and 
also incorporated a buried asparagine residue which disfavors homodimerization by up to 

15 2.3 Kcal/mol (Figure 1) (Oakley M. G. et aL; 1998, Biochemistry, 37, 12603-12610). 

2. Dissection of GFP 

[0098] The variant GFP (sglOO) was dissected at a surface loop between residues 157 
and 158, a position that has previously been shown to accommodate a 20 residue amino 
20 acid insertion ( AbcdUM \ -et aL , 1 998, Nucleic Acid ** ^ 6 23-630). The dissection resu lted in N- 
and C-terminal fragments, designated NGFP and CGFP, containing 157 and 81 residues, 
respectively (Figure 1). The NGFP fragment contains the three residues, Ser65, Tyr66, 
and Gly67, that ultimately form the GFP fluorophore (Tsien, R. Y., 1998, Annu. Rev. 
Biochem., 67, 509-544). 

25 

3. In Vitro Reassembly of the Dissected GFP Fragments Using the Designed 

Helices 

[0099] The designed helix, NZ was appended to the C-terminal of NGFP, via a six 
residue linker, to generate the fusion peptide designated NZGFP. Similarly, CZ was 
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appended to the N-terminal residue of CGFP, via a four residue linker, to generate the 
complementary fusion peptide, CZGFP. 

[0100] It was envisioned that if NZGFP and CZGFP were competent to heterodimerize 
via the designed helices, either in vitro or in vivo, the reconstituted GFP protein would 
5 display its characteristic fluorescence, indicating the correct reassembly of the tertiary 
fold from the peptide fragments. The genes encoding the designed protein sequences 
NZGFP, CZGFP, NGFP, and CGFP were cloned and the resulting proteins overexpressed 
and purified using methods routinely practiced by the skilled artisan. 
[0101] To investigate the viability of the protein reassembly strategy, a literature 

10 protocol devised for the refolding of denatured GFP was followed (Reid, B. G. et al, 
1997, Biochemistry, 36, 6786-6791). Thus, equimolar amounts (4 :M) of the fragments, 
NZGFP and CZGFP, were denatured in 6 M GdmHCl and dialyzed into a buffer 
containing 2 mM DTT, 10 mM phosphate buffer at pH 7.2 over 24 hrs at 4 °C. The 
reassembled peptides were visibly green. Moreover the 8max for the fluorescence 

15 excitation and emission spectra were identical to that of the parent GFP (Figure 2 inset). 
To verify that the reassembly was indeed guided by the antiparallel leucine zippers, 
control experiments were done with fragments with and without the leucine zippers. It 
was found that solutions containing NGFP, CGFP, NGFP/CGFP, NZGFP/CGFP, or 
NGFP/CZGFP did not fluoresce, even at concentrations of over 100 :M. The apparent 

20 dissociation constant, K^pp, for the NZGFP/CZGFP complex was determined by titrating 
NZGFP into a solution of CZGFP and monitoring the fluorescence emission intensity at 
505 nm (Figure 2). The data were fitted to a two-state binding isotherm, yielding a K^p 
of 3 1 ± 7 nM and V-analysis of the binding data verified the expected 1 : 1 stoichiometry 
of NZGFP and CZGFP (Bagshaw, C. R.; et al, 1987, Spectrophotometry and 

25 spectrofluorimetry: A practical approach, pp 91-1 13). 

Example 2: In Vivo Reassembly of the Dissected GFP Fragments 
[0102] BL21(DE3) E. coli cells were transformed with equimolar amounts of NZGFP 
and CZGFP encoding plasmids. The appearance of green color was monitored to identify 
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cotransformed colonies expressing reassembled GFP. After 36 hours several of the 
colonies turned green as illustrated in Figure 3a. with a cotransformation efficiency of 
4%. Individual colonies were cultured in liquid media and their protein expression 
pattern analyzed. The green colonies were shown to express similar amounts of NZGFP 
5 and CZGFP (Figure 3b and 3c) 5 whereas non-fluorescent colonies were shown to contain 
either NZGFP or CZGFP. Furthermore, control cotransformation experiments with 
NGFP/CGFP, NGFP/CZGFP and NZGFP/CGFP failed to show any green colonies, thus 
emphasizing the requirement for the presence of both NZ and CZ leucine zippers to 
mediate GFP assembly in vivo and in vitro. 

10 

Example 3: Reassembly of Proteins Using Antiparallel Leucine Zipper 
[0103] The methods described above for reassembly of GFP in vivo and in vitro may be 
modified for reassembly of any protein of interest, using antiparallel leucine zippers. The 
particular proteins are not critical, so long as they can be divided into fragments that 
15 produce a detectable signal upon their association, specific binding, or complexation 
mediated by the formation of an antiparallel zipper with a known biological activity or 
function that can be assayed for in vitro or in vivo, for example, kinase activity for a 
protein kinase, proteolytic activity for a protease, and DNA binding activity of DNA 
binding protein. 

20 [0104] The peptide fragments of the protein of interest are fused to each of the helices 
(SEQ ID NO: 1 and SEQ ID NO: 2) described above. Alternatively, other pairs of helices 
that form antiparallel leucine zippers may be designed and fused to the peptide fragments 
of the protein of interest. 

25 1 . In Vitro Reassembly 

[0105] Equimolar amounts of the fusion peptides comprising peptides of the protein of 
interest and helices that form antiparallel leucine zippers are denatured and dialyzed as 
described in Example 1 . The reconstitution of the protein is monitored. 
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2. In Vivo Reassembly 
[0106] Equimolar amounts of plasmids encoding the fusion peptides are transformed in 
host eucaryotic or procaryotic host cells as described in Example 2. The cotransformed 
colonies expressing reassembled protein are identified. 

5 

Example 4: The GRIP Assay 
[0107] In order to test the specificity of the GRIP assay, a set of four possible LZ 
combinations which were either electrostatically matched (EE-NGFP/KK-CGFP and 
NZGFP/CZGFP) or mismatched (EE-NGFP/CZGFP and NZGFP/KK-CGFP) (Fig. 4; 
10 Bryson, et al, 1995, Science 270, 935-941, Oakley et al, 1998, Biochemistry 37, 12603- 
12610) were designed. The GRIP assay would only allow the matched pairs to 
competently fold and catalyze fluorophore formation in GFP. 

[0108] Escherichia coli (BL21) cells were cotransformed with plasmids encoding the 
proteins of interest and plated on ampicillin containing plates. Fluorescent colonies were 

15 observed only in the complementary pairs (EE-NGFP/KK-CGFP and NZGFP/CZGFP). 
No visible fluorescence was observed in colonies containing the uncomplementary pairs 
(EE-NZGFP/CZGFP and NZGFP/KK-CGFP). Since the electrostatically mismatched 
pairs have a dissociation constant, IQ, of ~ 100 (iM (Yao, S., et al, 1998, Nature 396, 
447-450), this experiment sets an initial lower visual limit for detecting protein-protein 

20 interactions using the GRIP assay. 

Example 5: GRIP Assay and Combinatorial Selection 
[0109] A library of LZ partners that could either code for Glu or Lys with equal 
probability at the e and g "specificity" positions (Fig. 5A) was generated (Oakley, M. G. 
25 et al, 1998, Biochemistry 37, 12603-12610; Pelletier, J.N. et al, 1999, Nat. BiotechnoL 
17, 683-690; Dmitry, K., et al, 1994, EMBOJ. 13, 2849-2861). This library was fused to 
the C-terminal GFP fragment (XX-CGFP). It is thought that the selected partners would 
be enriched in Lys in order to complement the acidic EE-NGFP. 
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[0110] The 256-member plasmid-encoded library of XX-CGFP was co transformed with 
EE-NGFP and selected colonies that exhibited fluorescence. The protein expression 
profiles of the two protein fragments, XX-CGP and EE-NGFP, were virtually identical in 
cotransformed cells (Fig. 5B), thus excluding differences in relative protein concentration 
5 as a major determinant of the observed fluorescence. Sixteen of the multiple colonies 
exhibiting fluorescence were sequenced. The results of the selection are summarized in 
Figure 5C. The selected LZ partners of EE-NGFP displayed an overall 3:1 ratio of 
Lys:Glu residues, with the fewest Lys residues being 5 and the most being 7. Thus, an 
overall enrichment of Lys residues was observed as predicted from the requirement for 

10 electrostatic pairing of Lys/Glu for stabilizing the leucine zipper. Assuming an average 
value of 0.85 kcal/mol penalty for each Glu-Glu pair relative to Lys-Glu pair based on 
literature precedence, (Dmitry, K., etal, 1994, EMBOJ. 13, 2849-2861; Zhou, N. E. et 
aL, 1994, Protein Eng. 7, 1365-1372), a 75 fold difference in between the best with all 
Lys (IQ = 33 nM) (Ghosh, I. et al. 9 2000, /. Am. Chem. Soc. 122, 5658-5659) and worst 

15 (5 Lys and 3 Glu) leucine zipper partners for EE-NGFP was estimated. Thus, this 

experiment sets a lower threshold for the visual detection of interacting proteins in the 
GRIP assay to ~ 2.5 ]LiM, which is within the observed dissociation constant for most 
specific protein partners. 

20 Example 6: Inhibition of Protein-Protein Interactions 

[0111] To verify that the GRIP assay could be utilized for detecting inhibitors of 
protein-protein interactions (Zutshi, R., etal, 1998, Curr. Opin. Chem. Biol 2, 62-66), a 
LZ peptide corresponding to the LZ present in NZGFP (Fig. 4) was synthesized. The 
reappearance of fluorescence of the disassembled NZGFP/CZGFP complex in an in vitro 

25 assay as a function of added peptide was monitored (Fig. 6). The EK peptide (SEQ ID 
NO: 1) prevented the assembly of the complex (4 ^M) with an IC 5 o value of 3 1 |iM. 
Control experiments with addition of either NGFP or CGFP fragments that lacked leucine 
zippers did not prevent reassembly of NZGFP/CZGFP complex (Fig. 6). It is worth 
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noting that disassembly of an existing GFP complex was not achievable even at > 1 mM 
added peptide inhibitor. 

Example 7: GRIP Assay and Identification of Binding Partners Via 
5 Combinatorial Selection 

[0112] The GRIP assay may be modified by substituting the helices that form 
antiparallel leucine zippers with test proteins or peptides to determine whether a test 
protein or peptide attached to one portion of GFP interacts with another test protein or 
peptide attached to the other portion of GFP. The test proteins can be any protein. As an 
10 example, an orphan receptor can be fused to one portion of the GFP, while test ligands 
can be fused to the second portion of GFP. 

[0113] Specifically, nucleic acid encoding a fusion protein comprising an orphan 
receptor and a first portion of the GFP and a plasmid library of fusion proteins comprising 
test ligands and the second portion of GFP can be cotransfected or cotransformed into 

15 host cells. Colonies exhibiting fluorescence are selected, since they contain GFP 

molecules that have been properly folded or reassembled, and test ligands that interact 
with the orphan receptor. The colonies can be further cultured and investigated to 
determine the structural properties of the ligand. The molecular weight of the ligand may 
be determined by SDS-PAGE, and the primary structure may be determined by amino 

20 acid sequencing. 

[0114] Examples of orphan receptor groups include but are not limited to CCRL2, 
CMKLR1, CMKRL2, GPR31, HM74, and RDC1. Specific examples of orphan receptors 
of each group include but are not limited to: 1) CCRL2: chemokine (C-C motif) 
receptor-like 2, HCR, CRAM-B, CKRX, CRAM- A, lipopolysaccharide inducible C-C 

25 chemokine receptor related, E01; 2) CMKLR1: chemokine-like receptor 1, ChemR23, 
CMKRL3, DEZ, CMKLR1, LOC60669: G-protein coupled chemoattractant-like 
receptor; 3) CMKRL2: chemokine receptor-like 2, CMKRL2, FEG-1, GPCR-BR, 
DRY 1 2, CEPR, GPR30, GPR4 1 ; 4) GPR3 1 : G protein-coupled receptor 3 1 , GPR3 1 , 
Gpr31b; 5) HM74: putative chemokine receptor, GTP-binding protein; and 6) RDC1: 
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chemokine orphan receptor, D2S87E, GPRN1, CMKOR1, canine orphan receptor RDC1 
homolog, chemokine orphan receptor 1, Rdcl. 

[0115] The GRIP assay may be modified to detect macromolecular interactions, for 
example, specific protein-protein interactions, both in vitro and in vivo. When the 
5 proteins attached to the two GFP fragments associate with each other, the two GFP 
fragments will properly reassemble and fluoresce. Thus, in the absence of association, 
the proteins attached to the GFP fragments do not fluoresce. Fluorescence of interacting 
protein pairs linked to NGFP and CGFP can provide a sensitive assay for detecting the 
affinity and specificity of the individual protein pairs (and their mutants) under 
10 investigation. 

[0116] Examples of protein-protein interactions include, but are not limited to, 
antigen/antibody, ligand/receptor, antagonist or inhibitor/protein, binding protein/protein, 
and enzyme/substrate. Specific protein-protein interactions involved in disease and 
identified as potential drug targets include examples such as Bax/Bcl-2 (Sartorius, et al, 
15 2001, Chembiochem, 2 (1), 20), p53/mdm2 (Moll et al, 2000, Drug Resist Update, 3 (4), 
217)), VEGF/VEGF-R (Plate et al, 1992, Nature, 359, 845), IL-6/IL-6R (Akira et al, 
1993, Adv. Immunol, 54, 1), Ras/Raf (Weinstein-Oppenheimer et al, 2000, Pharmacol 
77ier.,88(3), 229). 

[0117] Examples of other macromolecular interactions include, but are not limited to, 
20 nucleic acid-nucleic acid binding protein interactions and carbohydrate-protein 
interactions, 

Example 8: GRIP Assay and Identification of Inhibitors Via Combinatorial 
Selection 

25 [0118] The GRIP assay may be modified to identify inhibitors of a specific protein- 
protein interaction. For example, a receptor can be fused to a portion of GFP, while a 
ligand can be fused to a second portion of GFP. A test inhibitor, such as a test antagonist, 
can be incubated with the two GFP fusion proteins comprising the ligand and receptor to 
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see if it prevents the reassembly of GFP which can be detected by the loss of 
fluorescence. 

[0119] Specifically, nucleic acid encoding a fusion protein comprising a known receptor 
and a first portion of the GFP, nucleic acid encoding a fusion protein comprising its 

5 known ligand and the second portion of GFP, and a plasmid library of test antagonists can 
be cotransfected or cotransformed into host cells. Colonies that do not exhibit 
fluorescence are selected, since they contain GFP molecules that have been prevented 
from folding or reassembly and test antagonists that inhibit the interaction of the known 
receptor with its ligand. The colonies can be further cultured and investigated to 

10 determine the structural properties of the ligand. The molecular weight of the ligand may 
be determined by SDS-PAGE, and the primary structure may be determined by amino 
acid sequencing. 

[0120] It should be understood that the foregoing discussion and examples merely 
present a detailed description of certain preferred embodiments. It therefore should be 
15 apparent to those of ordinary skill in the art that various modifications and equivalents 
can be made without departing from the spirit and scope of the invention. All journal 
articles, other references, patents, and patent applications that are identified in this patent 
application are incorporated by reference in their entirety. 
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